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Figure 1 




Cloning of the endothelial cell tropic and leukotropic clinical 
isolate VR1814 as FIX-BAC 




Fig. 1 Construction of FIX-BAC. A gpt-tk-BAC cassette was introduced into the 
US2-6 region of the parental virus VR1814 as described in patent application 
PCT/EP02/01 867 (WO 02/066629). 
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Figure 2 



RVFIX virus mutants 

RVFIXAULb' 

RVFIXAUL132-128 

RVFIXAUL1 33-148 

RVFIXAUL132-130 

RVFIXAUL131K 

RVFIXAUL130 

RVFIXAUL128K 

RVFIXAUL132K 

RVFIXAUL148 

RVFIXAUL146-147 

RVFIXAUL127 



Deletion according to (Chee, Bankier et al., 1990) 
nt 175662-189347 
nt 174857-176865 
nt 177896-189347 
nt 175662-177743 
nt 176377-176776 
nt 175662-176314 
nt 174953-175387 
nt 177076-177752 
nt 177896-178644 
nt 179069-179966 
nt 174467-174770 



Fig. 2 List of RVFIX virus 
mutants and determination of 
the mutation (kanamycin 
cassette insertion) according 
to annotation by {Chee, 
Bankier, et al. 1990 269 /id} 



Figure 3 shows an agarse gel and Southern Blot of RVFIX and RVFIX mutant 

viruses. 
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Fig.3 DNA derived from individually grown FIX-BAC clones (lane 1) or FIX-BAC 
mutant clones AULb', AUL130-132, AUL132, AUL131, AUL130, AUL130K, 
AUL128 and AUL128K (lanes 2-9) was digested with Hind\\\ and run on a 0.5% 
agarose gel. / 
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Figure 4 

A 



FIX 176834 -GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

RACE_95_3 GTCTGCAACATGCGGCTGTCTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

RACE_95_8 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

RACE_9 5_1 1 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

FIX GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAA--CGATTATTACCGAGTACCGCATTACT 

RACE_9 5_3 GGTC AGTGCCAGCGGGAGACCGCAGAAAAAAA- -CGATTATTACCGAGTACCGCATTACT 

RACE_9 5_8 GGTC AGTGCCAGCGGGAGACCGCAGAAAAAAA- -CGATTATTACCGAGTACCGCATTACT 

RACE_9 5_1 1 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAAAACGATTATTACCGAGTACCGCATTACT 

FIX GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_95_3 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_95_8 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_9 5_1 1 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

FIX TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_9 5_3 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_95_8 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_9 5_1 1 TGGACCTCACGTTGAACTACC ACTACGATGCGAGCC ACGGCTTGGACAACTTTG ACGTGC 

FIX TCAAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGT 

RACE_9 5_3 TC AAG AG 

RACE_95_8 TCAAGAG^GAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGT 

RACE_9 5_1 1 TC AAGAG ' 

FI x AACGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGC^AATCA 

RACE_9 5_3 ^AATCA 

RACE_9 5_8 AACGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGC^GAATCA 

RACE_95_11 AATCA 

FIX ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 

RACE_95_3 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTATACGTCAGAACCGTCGCGGCGGCA 

RACE_95_8 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 

RACE_9 5_1 1 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 

FIX CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_9 5_3 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_9 5_8 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_9 5_1 1 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCtGGCGCCTCACGCCCGGAGCCTCG 

FIX AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA - 1 76346 

RACE_95_3 AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA 

RACE_9 5_8 AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA 

RACE_9 5_1 1 AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA 

B 

FIX 175631 -CCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 

RACE_9 5_3 CCGTGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 

RACE_95_8 CCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 

RACE_9 5_1 3 CCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 

FIX7 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAAC 

RACE_9 5_3 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGGGAATTC ATAAACGTCAAC 

RACE_9 5_8 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTC ATAAACGTCAAC 

RACE_9 5_1 1 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAAC 
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FIX CACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTA 

RACE_9 5_3 CACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTC ACCGTCGC^ACGTA 

RACE_95_8 CACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTA 

RACE.9 5_1 1 CACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGC . 



FIX TTTTTATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGATAG 

RACE_95_3 TTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAG 

RACE_95_8 TTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAG 

RACE_95_11 

fix ccatgttccatcgacgatcctcgggaatgccagagtagattttcatgaatccacaSgctg 

race 95 3 ccatgttccatcgacgatcctcgggaatgccagagtagattttcatgaatccacXggctg 

RACE_95_8 ccatgttccatcgacgatcctcgggaatgccagagtagattttcatgaatccacaggctg 

RACE_95_11 — — -GCTG 

FIX CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC ■ 

RACE_95_3 CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

RACE_95_8 CGGTGTC CGG ACGGCG AAGTCTGCT AC AGTCC CG AG AAAAC GG C TG AG ATT CG CGGG ATC 

RACE_9 5_1 1 CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

FIX GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

RACE_9 5_3 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAACTGC 

RACE_95_8 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

RACE_9 5_1 1 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

FIX AACTACAATCCgTAAGTCTCTTCCTCGAGGGCCTTACAGCCTATGGGAAAGTAAGACAGA 

RACE_95_3 AACTACAATCC 

RACE_9 5_8 AACTACAATCC 

RACE_9 5_1 1 AACTACAATCT . 

FIX GGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCCCCTC 

RACE_95_3 

RACE_95_8 : 

RACE_95_11 

FIX CGTGTTGT^GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_9 5_3 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_95_8 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_9 5_1 1 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

FIX AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_95_3 AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_95_8 AGGCGCAGTACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_95_U AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

FIX ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA 

RACE_95_3 ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA 

RACE_9 5_8 ACGACAAGATAGCCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA . 

RACE_9 5_1 1 ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAATACA 

) 

FIX AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA- 1 74887 ■ 

RACE_9 5_3 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

RACE_95_8 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

RACE_9 5_1 1 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

C 



FIX 17489 2 -CGCTAAAATGGGCTATATGCTGCAGTGAA^^AA^TGTGTGTTTGTCCGA- 1 7 4 84 i 
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_ CGCTAAAATGGGCTATATGCTGCAGTGAATKAT^gATGTGTGTTTGTCCGCAAAAAAAA 

CGCTAAAATGGGCTATATGCTGCAGT AAT^p^ATGTGTGTTTGTCCAAAAAAAAAA - 
» CGCTAAAATGGGCTATATGCTGCAGTGAAT ^^^ ATGTGTGTTTGTCCAAAAAAAAAA - 

Fig. 4 UL131-128 mRNA processing - Panels (A-C) compare FIX-BAC DNA 
sequence (numbered according to Chee et al.) to a set of cDNA sequences from 
RACE clones 95-3, 95-8 and 95-11 (A) UL131 region, (B) UL128 region, (C) 
UL131-128 transcripts 3' end. Start codons, stop codons and the polyA site are 
in bold face, mRNA processing signals (splice donor sequence, splice acceptor 
sequence, AATAAA signal) are grey-shaded. 



RACE_95_3 
RACE_95_8 
RACE_95_11 



Figure 5 
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176834-GTCTGCAA 



jS:' ;• J< ; jS — — — — ■ ii m mi l 



PAGCC TGC GTC AC GGGAAATAAT 



H * mmm i < m m mm m m m <w§g«; 




&KG'@iSASG@(S*i^OrJSS^©A(' i ■ : RTTGAGCCCGTCGCGCGCGCAGGGAA 



g£|AC GT ATTTTC ATGATTGTC TGC GTTC TGTGGTGC GTC TGG ATTTGTC TC TC G AC GT 
Y V F S 

ttctgatagccatgttccatcgacgatcctcgggaatgccagagtagattttcatgaatccacSg! 



hGThagTCTC 



TAAGACAGAGGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCCCCTCCGTGTTGTM 




Fig. J Exon-intron organization of the FIX-BAC UL131-128 genetic locus. UL131 
(green); UL130 (orange); UL128 (blue); UL128xl C-terminus (light blue). 




128x2 




128x1 





130 — Ji31x2h— 131x1 




Translation of SEQKlon95-3.txt 

l ah 



61 
21 



iai 

61 



241 
81 



301 
101 



361 
121 



HCK-1 (pUL131) 



MR L S R V W L S ,V C LCAVVLOQC 

CAGCGGGAGACCGCAGAAAAAAACGA TTATTACCQAGTA CCGCATTACTGGGACGCGTGC 
ORETAEKNDYYRVPHYWDAC 

TCTCGCOCGCTGOCTGACCAAACCCaTTACAAGTATOTOGAA CAGCTOQTQQACCTCACO 
SRAL PDQTRYKYVBQLVDLT 

TTOAACTACCACTACmTCCaAGCCACQGCTTGGACAACTTTO 
LNY HYDASHO LDNFDVLKR I 



AACQTaACCGAGGlVlXAJl u l^C J XCATCAGCGAiJ'l u I u iATACGTCAGAA( 
NVTEVSLLI SDFIRQ 



TNK RTTFNA 



O S L A P 




GAOTTCAOCGTGCGGCTCTTTOCCAACTAO 
EFSVRLFAN* 




pUL131 



128x2 




128x1 





130 











131x2 





1 

131x1« 

i 




Tranalation of SEQKlon95-8.txt HCK"2 (pUL131x1) 



pUL131x1 



61 
21 



181 
61 



ATCCGGCTCTGTCGGOTGTGGCTaTCT 
HRLCRVW. LSVC LCAVVLOQC- 

CAGCOGGAGACCGCAGAAAAAAACOATTATTACCGAOTACCGCATTACT^^ 
ORETAEKNDYYRVPHYWDA C 

ACTOGTTACAAGTATOTGGAACAGCTCGTGaACCTCACO 
QTRYKYVEQL VDLT 



S R A L P D 



TTGAACTACCACrTACX^TGCGAGCCACGGCTTGGACAACTTTG^ 
LNYHYDASHGLDNFDVLKR* 



Fig. 6 Scheme of the differentially spliced transcripts of the UL1 31-128 region. Upper 
panel RACE clone 95-3 and predicted open reading frame (orf) pUL131 (HCK-1). Lower 

panel RACE clone 95-8 and predicted orf UL131xl (HCK-2). 




Translation of SEQ128 . txt ( 3r563 I HCK-4 (pUL128) 



61 
21 



121 
41 



181 
61 



241 
81 



301 
101 



M S 



GCGTTGTGGCTGCTAT3ISGTCACAGC 
NLTPPLTALW. LLLGHS 



CGCGTGCC GCGGGTAC GCGC AGAAGAATGTTGCGAATTCATAAAC GTCAACC ACCCGCCG 
RV P.RV RA E EC CE F INVNH P P 

GAACGCTGTTACGATTT<AAAATGTGCAATCGCTTC 
ERCYDF KMCNRPTVALRCPD 

GGCGAAGTCTGCTACAGTCCCGAGAAACGGCTGAGATTCGCGGGATCGTCACCACCATG 
GEVC YS PEKTAEIRG IVTTM 

ACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAATCTG 
THSLTRQVVHNKLTSCNYNL 

TTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGTAC 
LYL EADGR IRC GKVNDKAQY 





128x2 
















128x2 




128x1 





pUL128 



361 
121 



CTGCTGGGCGC CGCTGGCAGCGTTCCCTATCGATGGATCAAC CTGGAATACGACAAGATA 
LLGAAGSVPY RWINLEYDKI 



421 
141 



ACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGGAT, 
TRIVGLDQYLESVKKHKRLD 



481 
161 



GTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGA 
VCRAKMGYMLQ * 



Translation of SEQ128 x ltxt HCK-3 (pUL128x1) 



61 
21 



121 
41 



MS PKNLTPFLTA LWLLLGHS 

CGCGTCCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
RVPRVRAE EC CE P I NVNH PP 

GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTCATGA 
ERCYDFKMC NRPTVAYVFS * 



128x2 




128x2 




128x1 







pUL128x1 
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Fig. 7 Scheme of the differentially spliced transcripts of the UL131-128 region. 
Upper panel SEQUL128B and predicted open reading frame (orf) pUL128 (HCK- 
4). Lower panel SEQUL128A and predicted orf UL128x1 (HCK-3). 



Figure 8 



Northern Blot Analyses 



RVFIX, RVFIX mutants and laboratory strains: 



1 ^ ^ ^ <§> 
<lr 4? 4? 4? 4? <S? 




Fig. 8 mRNA was prepared from RVFIX-infected fibroblasts 4 days p.i. using 
Rneasy Mini, QIAshredder and Oligotex mRNA Mini kits according to the 
manufacturer's guidance (Qiagen). For Notherh blotting, 1 /jg RNA was 
electrophoresed on an agarose gel according to the MOPS-formaldehyde 
protocol and blotted onto Hybond N+ membranes (Amersham Pharmacia). Blots 
were hybridized with a UL1 31-128 specific probe. 
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Figure 9 

Comparison RACE clone 95-3 - FIX genomic sequence 



Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-3.txt, from 1 to 1741 



SEQFIX UL131-128.txt: SEQKlon95-3.txt identity= 99 . 66% (1735/1741) 
gap=11.94%(236/1977) 



1 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGfGCTG 

III MM III II I Illllll Mill II III II III II II II II 1 1 II II I 

1 . " ATGCGGC TGTC TC GGGTGTGGC TGTC TGTTTGTCTGTGCGCCGTGGTGCTG 

61 GGTC AGTGC C AGC GGGAG AC C GC AGAAAAAAAC GATT ATTAC C G AGT AC C GC ATT AC TGG 

I II III I III Ml I II II INI II II MM MM III MINI Mill I Mill I Ml 1 1 

52 GGTC AGTGCC AGCGGGAGAC CGC AGAAAAAAAC GATTATTAC CGAGTAC C GC ATTAC TGG 
121 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCGTG 

1 1 1 II 1 1 M I M M 1 1 1 1 II II II 1 1 M 1 1 II 1 1 1 II II 1 1 1 1 1 II II 1 1 II 1 1 1 1 M I M 

112 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCGTG 
181 GACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGCTC 

MINI !■ MM MINIM MM MINIM I M MINI 'I Mill II Mill 

172 GAC C TC AC GTTGAAC T AC C AC T AC GATGC GAGC C AC GGC TTGG AC AAC TTTGAC GTGCTC 

241 AAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGTAA 
Mill 

232 AAGAG 

3 01 CGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCAAC 

Illllll 

235 AATCAAC 

361 GTGAC CGAGGTGTC GTTGC TC ATC AGCGACTTTAGACGTC AGAAC C GTCGCGGC GGC AC C 

1 1 1 II 1 1 1 II M II 1 1 1 1 II II II II I II I II 1 1 II 1 1 1 IN M II M I II M 1 1 II N 

244 GTGACCGAGGTGTCGTTGCTCATCAGCGACTTTATACGTCAGAACCGTCGCGGCGGCACC 
421 AAC AAAAGGACCACGTTC AACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 

1 1 1 M M II I II M 1 1 II II M II 1 1 1 1 1 II II II 1 1 1 II I II II I II I II I M I II II I 

3 04 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 
481 TTCAGCGTGCGGCTCTTTGCC AACTAGCCTGCGTCACGGGAAATAATATGCTACGGCTTC 

I Mill I MM II I II II MM II II Nil III II II Illllll MM MM II II MM 

3 64 TTC AGCGTGCGGCTCTTTGCC AACTAGCCTGCGTCACGGGAAATAATATGCTACGGCTTC 
541 TGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 

I II MIMM M M M MINI IMIIMI Mill M I IMM M N 1 1 IN II II N N 

424 TGCTTCGTC ACC ACTTTC ACTGCCTGCTTCTGTGCGCGGTTTGGGC AACGCCCTGTCTGG 
601 CGTCTCCGTGGTTC ACGCTAACGGCGAACC AGAATCCGTCCCCGCC ATGGTCTAAACTGA 

M 1 1 1 1 II I II II I II M M 1 1 1 II M II II 1 1 1 II II II 1 1 1 1 1 II I M II I II II M I 
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484 CGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCATGGTCTAAACTGA 
661 GGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 

I II MM III I MM Ml I III I IMIMI II III II II MINI II I M I III MM 1 1 

544 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 

» 

721 CACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGTCCCGAGTGTCGCA 

I II II II III 1 1 Ml III I III I II I MM II II I II II Mill I II M M Ml IMM I 

604 CACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGTCCCGAGTGTCGCA 
781 ACG AGAC C C TGT ATC TGC TGTAC AAC CGGG AAGGC C AG ACC TTGGTGG AG AGAAGC TC C A 

I II II II MM MM III MM I III MM II II I II IMIMI III M MIIMMM I 

664 AC GAGACC CTGTATC TGCTGTAC AAC C GGGAAGGC C AGACCTTGGTGGAGAGAAGC TCC A 
841 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATCAGACCATCCTCCAACGGA 

I II II 1 1 MM 1 1 II I M MM MM MM M 1 1 1 MM I MM 1 1 1 1 1 1 1 Ml I IMM 

724 CCTGGGTGAAAAAGGTGAT'CTGGTATCTGAGCGGTCGCAATC AGACC ATCCTCCAACGGA 
901 TGC CC C GAACGGC TTCGAAAC C GAGC GACGGAAACGTGC AGATC AGC GTGGAAGAC GCC A 

I M II M M II M M II I M II II I II M 1 1 1 1 II I II I II II II II II 1 1 II I II I II I 

784 TGC C C CG AAC GGC TTC GAAAC C GAGC G ACGGAAAC GTGC AG ATC AGC GTGGAAG ACGC C A 
961 AGATTTTTGG AGC GC AC ATGGTGC CCAAGC AGACC AAGC TGCTAC GTTTC GTC GTC AAC G 

MMMMMIMMMMMMMMMMMMMMMMMMMMMI IMM 

844 AGATTTTTGGAGCGC AC ATGGTGC CCAAGC AGACC AAGC TGCTACGTTTC GTC GCC AAC G 
1021 ATGGCACACGTTATCAGATGTGTGTGATGAAACTGGAGAGCTGGGCCCACGTCTTCCGGG 

I II II II MM MM MM II 1 1 Ml IMIMIM IMIMI II II 1 1 1 1 1 M II III 1 1 

904 ATGGC AC ACGTTATC AGATGTGTGTGATGAAAC TGGAGAGC TGGGCC C AC GTC TTCC GGG 
1081 ACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACCAGACTTACA 

1 1 III I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml II 1 1 1 M II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 II 1 1 1 

964 ACT AC AGC GTGTCTTTTC AGGTGCGATTGACGTTC AC C GAGGC C AATAAC C AGAC TTAC A 
1141 CCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGC AGGGAATTTTGAA 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1024 CCTTCTCCACCC ATGCCAATC TCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTGAA 
1201 AACCGCGCGTC ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 

IMM I II II I III II 1 1 III MM IMIMI 1 1 II II II MM M 1 1 1 1 Ml llllll 

1084 AACCGTGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 
1261 TGGGTCAC AGCCGCGTGCCGCGGGTACGCGC AGAAGAATGTTGC.GAATTCATAAACGTCA 

I M Ml 1 1 III I II 1 1 II I MM III MM II II I II Mill II II 1 1 1 1 II 1 1 1 1 1 II I 

1144 TGGGTCAC AGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTC ATAAACGTC A 
1321 ACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGC AATCGCTTC ACCGTCGCGTACG 

I M III I II II II M I M II II I Ml MM MM I Ml II II II I II II M IMIMIM 

1204 AC C AC CC GCC GG AAC GC TGTTAC GATTTC AAAATGTGC AATC GC TTC AC C GTC GC GTAC G 
1381 TATTTTC ATGATTGTC TGC GTTC TGTGGTGCGTC TGGATTTGTCTC TC GAC GTTTCTGAT 

1 1 1 M II II II I II I II 1 1 II II I i 1 1 II I II M I II M M M M M M MM M M M I 

12 64 ' TATTTTC ATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGAT 
1441 AGC C ATGTTC C ATC GAC GATCC TCGGGAATGC C AGAGTAGATTTTC ATGAATCC AC AGGC 

MM II 1 1 III 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II Mill 

1324 AGCCATGTTCC ATCGACGATCCTCGGGAATGCC AGAGTAGATTTTC ATGAATCC ACAGGC 
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1501 TGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

MM I III II II III MM III MINIM II III II II I MM III I III III II I II I 

1384 TGC GGTGTCC GG AC GGC G AAGTC TGC T AC AGTCC C GAGAAAAC GGC TG AG ATTC GC GGGA . 
1561 TCGTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCT 

1 1 1 1 1 1 1 ! ( 1 1 1 1 1 1 1 1 [ I i j 1 1 ! 1 1 1 1 1 1 1 1 1 E ! 1 1 1 1 1 1 1 [ E I ! i I ( 1 1 1 1 1 1 i 1 II 

1444 TCGTCACCACCATGACCC ATTCATTGACACGCC AGGTCGTACACAAC AAACTGACGAACT 

1621 GC AACTAC AATC CGTAAGTC TC TTC C TCGAGGGCCTTAC AGCCTATGGGAAAGTAAGAC A 
1 I I I I I f I I I I I I 

1504 GCAACTACAATCC 

1681 GAGGGACAAAAC ATC ATTAAAAAAAAAGTCTAATTTC ACGTTTTGTACCCCCCCTTCCCC 

1517 

1741 TC CGTGTTGTAGGTTATAC C TC GAAGC TGAC GGGCGAATAC GCTGCGGC AAAGTGAAC GA 

II 1 1 II 1 1 II MMMMIMM MM 1 1 II 1 1 ! 1 1 1 1 M 1 1 1 1 1! II 

1517 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGA 

1801 CAAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGA 

1 1 1 1 1 II I II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 II 1 1 1 II 1 1 1 M 1 1 1 1 II 1 1 M II I II I 

1565 C AAGGC GC AGT ACC TGC TGGGCGCC GC TGGC AGC GTTC C C T ATC GATGG ATC AAC CTGGA 
1861 AT AC GAC AAGATAAC C C GGATC GTGGGCC TGGATC AGTAC C TGGAGAGC GTTAAG AAAC A 

MM M M II II M 1 1 1 1 1 1 1 1 1 II 1 1 1 II M Ml 1 1 Ml II M M 1 1 II 1 1 II I II II I 

1625 AT AC GAC AAGATAAC C CGGATCGTGGGCCTGGATC AGTAC CTGGA.GAGC GTTAAGAAAC A 

1921 • CAAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1685 CAAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA . 



Translation of SEQKIon95-3.txt: HCK-1 (pUL1 31 ) 

1 ATGC GGC TGTC TCGGGTGTGGCTGTC TGTTTGTCTGTGC GCCGTGGTGCTGGGTC AGTGC 
1 MRLSRVWLSVCLCA V VLGQ .C 

6 1 c agcgggagacc gc agaaaaaaacgattattac cgagtacc gc attac tgggacgcgtgc 
21 qretae k|dyy .rvph y wdac 

121 tctcgcgcgctgcctgacc aaacccgttacaagtatgtggaacagctcgtggacctcacg 
41 sralpdqtryk yveqlvdlt 

181 ttgaac tac c ac tac gatgc gagcc acggcttggac aactttg acgtgc tc aagagaatc 
61 l|yhyda shgld|fdvlkri 

241 aacgtgaccgaggtgtcgttgctcatcagcgactttatacgtcagaaccgtcgcggcggc 

81 . fVTEVSLLI SDF'IRQ.gRRGG 

301 AC C AAC AAAAGGAC C AC GTTC AAC GCC GCCGGTTC GC TGGC GCC TC AC GC C C GGAGCC TC 

ioi t|krt tf|aag slapharsl 



361 
121 



GAGTTC AGC GTGC GGC TC TTTGCC AAC TAG 
EFSVRLFAN* 
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Figure 10 



Comparison RACE clone 95-8 -FIX genomic sequence 

Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-8.txt, from 1 to 1849 



SEQFIX UL131-128 . txt : SEQKlon95-8.txt identity= 99 . 78% ( 1845/1849 ) 
gap=6.47%(128/1977) 



1 - GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

I II MM I II I II I IIMIII Ml III I Ml II MM I M I MM II I II I 

1 ATGCGGC TGTGTC GGGTGTGGC TGTCTGTTTGTC TGTGC GC C GTGGTGCTG 

61 GGTC AGTGCC AGCGGGAGAC C GC AGAAAAAAACGATTATTACC GAGTAC C GC ATTAC TGG 

I Ml III I II II MM Ml I II I IIMIII Ml III I Mill M 1 1 M I Ml I MM M I 

5 2 GGTC AGTGCCAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGG 
121 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCGTG 

MINI M II II M II 1 1 II II I MMIIIMI II M Mill M 1 1 1 1 MM I IIMIII 

112 GACGCGTGCTCTCGCGCGCTGCCTGACC AAACCCGTTACAAGTATGTGGAACAGCTCGTG 
181 GACCTC ACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGAC AACTTTGACGTGCTC 

M M M 1 1 1 1 1 M 1 1 1 1 1 M M I i M I M 1 1 M M M M M I M M M M M M M M M 

172 GACCTC AC GTTGAACTACC AC TACGATGCGAGCCACGGCTTGGAC AACTTTGACGTGCTC 
241 AAGAGGTGAGGGTAC GCGC TAAAGGTGTATGAC AACGGGAAGGTAAGGGC GAAC GGGTAA 

MM Ml I II M II II M II M II 1 1 IMIIMI I II Ml M II 1 1 1 1 1 III II II I M I 

232 AAGAGGTGAGGGTAC GC GC TAAAGGTGTATGAC AACGGGAAGGTAAGGGC GAAC GGGTAA 
301 C GGGTAGGTAAC CGC ATGGGGTGTG AAATGACGTTC GGAAC C TGTGC TTGC AGAATC AAC 

MM MM II II M II II II II I IIMIII IIMIII MM III II 1 1 1 Ml M II I M I 

292 CGGGTAGGTAACCGC ATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCAAC 
361 GTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCACC 

1 1 II III I M II II II M M II II IMIIMM III I Ml II II M M I Ml Ml 1 1 II I 

352 GTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGC ACC 
421 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 

M M i II 1 1 1 M M II 1 1 1 1 1 1 1 1 1 M M M M M 1 1 II 1 1 1 1 1 1 1 M M M M 1 1 1 Ml 

412 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 
481 TTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCACGGGAAATAATATGCTACGGCTTC 

I M II II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

472 TTC AGC GTGC GGCTC TTTGC C AAC T AGC CTGC GTC AC GGG AAAT AATATGC T AC GGC TTC 
541 TGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 

MM III I M M M II II II II I MIMIMM MM MUM M I M M M M 1 1 MM 

532 TGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 
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601 CGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCATGGTCTAAACTGA 

I II I Ml I II II II 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II I 

592 CGTCTCCGTGGTTC ACGCTAACGGCGAACC AGAATCCGTCCCCGCCATGGTCTAAACTGA 
661 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 

MM IMIMI llllll llllll MMMIMMMMMIMI ill MMIMMIM 

652 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 
721 CACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGTCCCGAGTGTCGCA 

Ml 1 1 II Ml II Ml I M llllll 1 1 MM II MM I II 1 1 MM III I II 1 1 Ml III I 

712 CACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGTCCCGAGTGTCGCA 
781 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 

I II II Mill II II MINIM II Ml III II I II MM I Mill III II II I lllllll 

772 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 
841 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATCAGACCATCCTCCAACGGA 

I MM lllllll MM II II INI III IIIMI III I III I Ml I III Mill lllllll 

832 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATCAGACCATCCtCCAACGGA 
901 TGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGCGTGGAAGACGCCA 

. I II 1 1 IN 1 1 II I II I II 1 1 INI 1 1 II 1 1 II MM I Ml I MM III 1 1 II Mil III I 

892 TGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGCGTGGAAGACGCCA 

961 AGATTTTTGGAGCGCACATGGTGCCCAAGCAGACCAAGCTGCTACGTTTCGTCGTCAACG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I : 

952 AGATTTTTGGAGCGCACATGGTGCCCAAGCAGACCAAGCTGCTACGTTTCGTCGTCAACG 

1021 ATGGC AC ACGTTATCAGATGTGTGTGATGAAACTGGAGAGCTGGGCCC ACGTCTTCCGGG 

1 1 1 1 1 1 i i I i I ! 1 1 1 1 1 1 1 1 1 i I IE 1 1 M I M 1 1 1 1 ! 1 1 

1012 ATGGC AC AC GTT ATC AG ATGTGTGTG ATG AAAC TGG AGAGC TGGGCC C ACGTC TTC C GGG 
1081 ACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACCiSGACTTACA 

III 1 1 II III II I II 1 1 II Mill I III 1 1 II I III Ml I Mil I II Mil Mil III I 

1072 ACTACAGCGTGTCTTTTC AGGTGCGATTGACGTTCACCGAGGCCGATAACCAGACTTACA 
1141 CCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTGAA 

Mil I lllllll MM llllllllllllllllllll Mill III I III I III MM MM 

1132 CCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTGAA 
1201 AACCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 

II I II M I M 1 1 1 1 M M II II II I M II II II II II II I II M I II 1 1 1 1 II II II M I 

1192 AACCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 
1261 TGGGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCA 

II 1 1 1 1 II I II 1 1 II M I M M 1 1 II I II 1 1 II 1 1 1 II I II II 1 1 II 1 1 M II I II I II I 

1252 TGlGGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCA 
1321 ACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACG 

I II I MM II II llllll llllll II llllll MM I MM MM MM MM I II MM 

1312 ACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACG 
1381 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGAT 

III II II Mill llllll IIIMI Ml Ml II MM III llllllllllllllllllll 

1372 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGAT 
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1441 AGCCATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCACAGGC 

I IIIIMIIHIMIIIII MM MINIMI! MM MM INI MM II IIIIIMII 

1432 AGCC ATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCAC AGGC 
1501 TGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

1 1 ! 1 1 1 j 1 1 J ! 1 1 1 1 [ l E l j 1 1 1 1 1 1 J 1 1 j 1 1 1 1 1 1 J r 1 1 1 1 1 1 1 i J 1 1 1 1 1 

1492 TGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

1561 TCGTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1552 TCGTCACC ACCATGACCCATTCATTGACACGCCAGGTCGTACAC AACAAACTGACGAGCT 

1621 GC AAC TAC AATC C GT AAGTC TC TTC C TCGAGGGC C TT AC AGC C TATGGGAAAGTAAGAC A 
I I I I I I I I I I I I I 

1612 GC AAC TAC AATC C 

1681 G AGGGAC AAAAC ATC ATT AAAAAAAAAGTC T AATTTC AC GTTTTGT AC C GC CC C TTC C C C 

1625 

1741 TCCGTGTTGTAGGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGA 

1 1! ! II I II 1 1 M M 1 1 i ! 1 1 1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 M > 1 1 

1625 GTTATAC C TC GAAGC TGACGGGC GAATAC GC TGCGGC AAAGTGAAC GA 

1801 CAAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATC AACCTGGA 

II II I MM II 1 1 1 1 II II II M I MIMM IMMIMI Illlllll MIIMMII I 

1673 CAAGGCGCAGTACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATCAACCTGGA 
1861 ATAC GAC AAGATAAC C C GGATC GTGGGC CTGGATC AGTAC C TGGAGAGC GTTAAGAAAC A 

MM Mill MM I I.I II III I II Mill III I III I II I III I III I ill II MM II I 

1733 ATAC GAC AAGAT AGC CC GGATC GTGGGC CTGGATC AGTAC C TGGAGAGC GTTAAGAAAC A 
1921 C AAACGGCTGGATGTGTGCCGC GC TAAAATGGGCTATATGC TGC AGTGAATAATAAA 

1 1 II 1 1 M II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 II M I II I 

1793 C AAAC GGC TGGATGTGTGC CGC GC T AAAATGGGC T AT ATGC TGC AGTGAATAATAAA 



Translation ofSEQKIon95-8.txt: HCK-2 (pUL131x1) 



1 ' ATGC GGCTGTGTC GGGTGTGGCTGTCTGTTTGTC TGTGC GC C GTGGTGC TGGGTC AGTGC 

1 M RLCRVWL.S V CLCA VVLGQC 

6 1 c agc gggagacc gc agaaaaaaac gattattac cgagtacc gc attactgggacgc gtgc 

21 qre taekIdyyrvphywdac 

121 tctcgcgcgctgcctgaccaaacccgttacaagtatgtggaacagctcgtggacctcacg 

41 s ralpdqtryky.veqlvdlt 

181 ttgaactaccactacgatgcgagccacggcttggacaactttgacgtgctcaagaggtga 

61 l|yh ydashgld|fdvlkr* 
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Figure 11 

Comparison RACE clone 95-11 -FIX genomic sequence 



Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-11.txt, from 1 to 1620 



SEQFIX UL131-128.txt: SEQKlon95-ll.txt identity= 99 . 57% (1611/1618) 
gap=18.24%(361/1979) 



1 GTC TGC AAC ATGC GGC TGTGTC GGGTGTGGC TGTC TGTTTGTC TGTGC GC C GTGGTGC TG 

1 1 1 1 1 1 1 II I Ml I II 1 1 1 1 MM II II II II II I ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

1 ATGCGGC TGTGTC GGGTGTGGC TGTC TGTTTGTC TGTGC GC CGTGGTGCTG 

6 1 GGTCAGTGCCAGCGGGAGACCGCAG . . AAAAAAAC GATTATT AC C GAGT AC CGC ATTAC T 

1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 M MM II II I! II IM I M M M I Ml II I M 

52 GGTC AGTGCC AGC GGGAGAC CGC AGAAAAAAAAAC GATTATTAC CGAGTAC CGC ATTAC T 
119 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

II I II I M MM I M I II I II 1 1 II MM MIMI II II II M I II II M I II II M M I 

112 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

179 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

I II M II I I II I I II I I I I I I I I I II M II II II II I I II I I II I I I I I II I I I I II II I 
172 TGGAC CTC AC GTTGAACT ACC ACTACGATGC GAGC C ACGGC TTGGAC AAC TTTGAC GTGC 

239 TCAAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGT 

MMMI 

232 TCAAGAG 

299 AACGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCA 

MMI 

237 , AATCA 

359 ACGTGACCGAGGTGTCGTTGCTC ATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGC A 

. 1 1 M M M M 1 1 1 1 1 1 II M M Ml I M I II 1 1 1 III II II II 1 1 Mi I III Ml II II I 

244 AC GTGACC GAGGTGTC GTTGC TC ATC AGC GACTTT AGAC GTC AGAACC GTCGC GGC GGC A 
419 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

MM 1 1 II II M I II I Mill 1 1 Illllll II MMI MM II 1 1 III 1 1 Ml MMMI 

304 CCAAC AAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 
479 AGTTC AGCGTGCGGCTCTTTGCCAACTAGCCTGCGTC ACGGGAAATAATATGCTACGGCT 

MMMMMMMM M MMMMMMMMMMMMMMMMMMMMM 

3 64 AGTTC AGC GTGC GGC TCTTTGCC AAC TAGCCTGCGTC ACGGGAAATAATATGCTACGGCT 
539 TCTGCTTCGTC ACC ACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGC AACGCCCTGTCT 
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MM II II MM II II II MM I Ml II II II II I II 1 1 1 1 II I Ml 1 1 1 1 II 1 1 1 1 1 1 1 

424 TCTGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCT 
599 GGCGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCATGGTCTAAACT 

IMIII II III I II II II MM MIMI IMIIIIII MM III I III I II II III III I 

484 GGCGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCATGGTCTAAACT 
659 G AC GT ATC C C AAAC C GC AT G ACGC GGC G AC GTTTT AC TGTCCTTTTCTC T ATC C CTC GC C 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 1 

544 GACGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCC 
719 CCC ACGGTC C CCC TC GC AATTC C CGGGGTTC C AGC GGGTATC AAC GGGTC CC GAGTGTC G 

I II M I M MM I M I II II II IIMIMM II MM MM 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 

604 CCCACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATTAACGGGTCCCGAGTGTCG 
779 CAACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCC AGACCTTGGTGGAGAGAAGCTC 

I MM I II MM I II I II II III I II II II III II II II II III I III II I II III III I 

664 CAACGAGACCCTGTATCTGCTGTAC AACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTC 
839 C AC C TGGGTG AAAAAGGTG ATCTGGTATC TGAGC GGTC GC AATC AG AC CATC C TC C AAC G 

IMIII II II I M 1 1 1 II M M M I II II I MIMI 1 1 II IM M 11 1 1 1 II IM I II I 

724 CAGCTGGGTGAAAAAGGTGATCTGGCATCTGAGCGGTCGCAATCAGACG ATCCTCCAACG 
899 GATGCC CC GAACGGC TTC GAAAC CGAGCGACGGAAAC GTGC AGATC AGCGTGGAAGACGC 

I MM I M Mill II I M 1 1 III III Mill II II II II 1 1 III 1 1 1 1 1 II Mill III I 

784 GATGCC CC GAACGGC TTC GAAAC CGAGCGAC GGAAAC GTGC AGATC AGC GTGGAAGACGC 
959 C AAGATTTTTGG AGC GC AC ATGGTGC C C AAGC AGACC AAGCTGC T AC GTTTC GTC GTC AA 

MIMI lilllll III MIIMI II IMIII II II I! IMIIMMM Mill III Ml 

844 CAAGATTTTTGGAGCGCACATGGTGCCCAAGCAGACCAAGCTGCTACGTTTCGTCGCCAA 
1019 C G ATGGC AC AC GTT ATC AGATGTGTGTGATG AAACTGGAGAGC TGGGC C C AC GTCTTC C G 

Mill III MM MM Mill II II IMIIIIII II MM III I II III I Mill III I 

904 CGATGGC AC AC GTTATTAG ATGTGTGTGATGAAAC TGGAGAGC TGGGC CC ACGTC TTC C G 
1079 "GGACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAAC.CAGACTTA 

Mill III Mill IMIMIIIIIIIIMIIII II II IMMM I II II IMIII I II I 

964 GGACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACC AGACTTA 
1139 CACCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTG 

1 1 1 1 1 II M 1 1 M 1 1 1 II M 1 1 III M I II III II II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II M 

1024 CACCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTG 
1199 AAAACCGCGCGTCATGAGTCCC AAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCT 

I MM II I MM I M 1 1 1 1 M 1 1 1 1 1 MM III II II 1 1 1 MM I II I Ml M Ml I II I 

1084 AAAACCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCT 
1259 ATTGGGTC AC AGC C GC GTGC C GC GGGTAC GCGC AGAAGAATGTTGCGAATTC AT AAAC GT 

1 1 III 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M M 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

11 4 4 ATTGGGTC ACAGCCGCGTGCCGCGGGTACGCGC AGAAGAATGTTGCGAATTCATAAACGT 
1319 CAACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTA 

I II 1 1 II I II 1 1 Ml I II I M M I II M Mill II IMIII I III M 1 1 M I MM I 

1204 C AAC C ACC CGCC GGAAC GC TGTT AC GATTTC AAAATGTGC AATC GC TTC ACC GTC GC . , 
137 9 CGTATTTTC ATGATTGTC TGC GTTC TGTGGTGC GTC TGGATTTGTCTC TC GAC GTTTC TG 
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1262 

1439 ATAGC C ATGTTC C ATCGAC GATCC TCGGGAATGC C AGAGTAGATTTTC ATGAATCC AC AG 

1262 

1499 GCTGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGG 

MM I M I II MM II II Ml MM I III I! I! II I MM II I II M 1 1 II II II 1 1 1 1 

1262 ACTGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGG 
1559 GATCGTC ACC AC C ATGAC C C ATTC ATTGAC AC GC C AGGTC GTAC AC AAC AAACTGAC GAG 

Mill MMIM IMIIMI 1 1 1 1 M MM IMMM II Ml II II M II MM Mill I 

1321 GATCGTC ACC AC C ATGAC CC ATTC ATTGAC AC GC C AGGTC GTAC AC AAC AAAC TGAC GAG 
1619 C TGC AAC T AC AATC C GT AAGTC TC TTC CTC G AGGGC C TT AC AGC C T AT GGGAAAGT AAG A 

Ml MM IMMM 

13 81 CTGCAACTACAATCT 

1679 CAGAGGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCC 

1396 

1739 CCTCCGTGTTGTAGGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAAC 

II 1 1 1 1 M 1 1 i I II I MM II I II 1 1 1 II 1 1 II 1 1 MM I II III 

1396 GTTATAC C TC GAAGCTGAC GGGC GAATACGCTGC GGC AAAGTGAAC 

1799 GAC AAGGC GC AGTAC CTGC TGGGC GC C GC TGGC AGC GTTC C C TATCGATGGATC AACC TG 

1 1 II 1 1 MMM III II 1 1 MM I II MM II Mill II II I IN III II II II II MM 

1442 GAC AAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTG 
1859 GAATACGACAAGATAACCCGGATCGTGGGCCTGGATC AGTACCTGGAGAGCGTTAAGAAA 

I MM I IIMIMM I II 1 1 1 II Ml MM MM III I Ml MMIIMI MMM I Ml 

1502 GAATACGACAAGATAACCCGGATCGTGGGCCTGGATC AGTACCTGGAGAGCGTTAAGAAA 
1919 CACAAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

II III MM M MMM 1 1 1 II III III I MM M I Ml 1 1 M 1 1 M 1 1 MMM I II 

1562 T AC AAAC GGC TGGATGTGTGC C GC GC T AAAATGGGC T AT ATGC T GC AGTGAATAAT AAA 
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Figure 12 



Comparison SEQ 128 B - FIX genomic sequence 



Upper line: FIX genomic sequence 
Lower line: SEQ 128 B 



5998 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

1 1! I II II II I II 1 1 1 1 II III MM III 1 1 II III III 1 1 Ml 1 1 II III I MM II 1 1 

1 ATGAGTCC CAAAAAC C TGAC GC C GTTC TTG ACGGC GTTGTGGC TGC T ATTGGGTC AC AGC 

6058 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 

I M Ml III 1 1 II 1 1 II II I M IIMIMI MMM Ml II MM I II Ml I M II II 1 1 

6 1 C GC GTGC C GC GGGT AC GC GC AGAAGAATGTTGC GAATTC AT AAAC GTC AAC C AC C C GC C G 
6118 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTCATGA 

Ml III INI MM Ml 1 1 II! IIIIIIIMIII! MM III! I Ml! Ill I MM INI 

121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTC ATGA 
617 8 TTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAGCCATGTTCC 

I II III Mil III I MM Mil Mill III Mill Mil MINI III IMIIMMM I 

181 TTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAGCCATGTTCC 
623 8 ATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCACAGGCTGCGGTGTCCG 

I II 1 1 M II 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 II II I II 1 1 II II I M 1 1 1 1 1 1 1 1 M I II I 

241 ATCGACGATCCTCGGGAATGCC AGAGTAGATTTTCATGAATCCACAGG'CTGCGGTGTCCG 
6298 GACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATCGTCACCACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mi ii 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

3 01 GAC GGC GAAGTC TGCTAC AGTC CC GAGAAAACGGC TGAGATTC GC GGGATC GTC AC C AC C 
6358 ATGACC C ATTC ATTGAC AC GCC AGGTC GTAC AC AAC AAACTGAC GAGC TGC AAC TAC AAT 

Ml Mill II I II 1 1 M 1 1 III IIMIMI III MIMI II MM III MM II M MM 

361 ATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAAT 

6418 CCGTAAGTCTCTTCCTCGAGGGCCTTAC AGCC'TAT'GGGAAAGTAAGAC AGAGGGACAAAA 

II - 

421 CC . 

647 8 CATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCCCCTCCGTGTTGTA 
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6538 GGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGAC AAGGCGC AGT 

II II II I II MM II II MM I MM lllllllll I II I Ml II II I II I IMIIII I! 

423 . GTTAT AC CTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGAC AAGGCGC AGT 
6598 ACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATCAACCTGGAATACGACAAGA 

III MM INI MINIMUM Mil llllllllllll Mil MM MM III MINI 

482 ACC TGC TGGGC GC C GC TGGCGGC GTTC C CT ATC G ATGGATC AAC C TGG AATAC G AC AAG A 
6658 TAGCCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAXCGGCTGG 

Ml II I M 1 1 M M I M M I M I M 1 1 1 1 M M II 1 1 II M I M 1 1 1 1 1 M M 1 1 M M i 

542 TAGC C C GGATC GTGGGC C TGGATC AGT AC C TGGAGAGC GTTAAG AAAC AC AAAC GGC TGG 
6718 ATGTGTGC C GC GC T AAAATGGGC TAT ATGCTGC AGTG AATAAT AAAATGTGTGTTTGTC C 

I II II 1 1 II II 1 1 M 1 1 1 II 1 1 MM II 1 1 III II I II I II II M 1 1 M M I M M II I 

602 ATGTGTGC C GC GC T AAAATGGGC T ATATGC TGC AGTG AATAAT AAAATGTGTGTTTGTC C 
677 8 GAAAT AC GC GTTTTGAG ATTTC TG 

MM II 

662 AAAAAAAAAAAAAAAAAAAAAAAA 



Translation of SEQ128 B x 1.txt: HCK-3 (pUL128x1) 

1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

l mspk|lt pfltalwlllghs 

61 CGiCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 

21 R VP RVRAE EC C EF I § V | H P P 

121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTCATGA 

41 E RCYD FKMCj§RFTVAYVFS* 
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Figure 13 

Comparison SEQ 128 A - FIX genomic sequence 

Upper line: FIX-BAC 

Lower line: SEQ128 A 

5998 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

IIMMIIMIIIMMMMIMI I Illllllllllll! 1 1 III III UN MM I III 

1 ATGAGTCC C AAAAACC TGAC GC C GTTCTTGAC GGC GTTGTGGCTGC TATTGGGTC AC AGC 

6058 C GC GTGCC GC GGGT AC GC GC AGAAGAATGTTGC G AATTC AT AAACGTC AAC C AC C C GC C G 

IHIMMMIMMIIIIMMIMMMMMMMMMMMMMMMMIIM 

61 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
6118 GAAC GC TGTTAC GATTTC AAAATGTGC AATC GC TTC ACC GTCGC GTAC GT ATTTTTATGA 

MMIIMMMIM IN MIMIMM! iMMMMM Ml! 

121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGC 

6178 TTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGATAGCCATGTTCC 

166 • • 

6238 ATCGACGATCCTCGGGAATGCCAGAGTAGATTTTC ATGAATCCACAGGCTGCGGTGTCCG 

Illllllllllll 

166 * * GCTGCGGTGTCCG 

6298 GACGGCGAAGTC TGC TAC AGTCC CGAGAAAAC GGCTGAGATTC GC GGGATCGTC AC C ACC 

IIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMIIIIIIIIIMIIM 

178 GACGGC GAAGTC TGC TAC AGTCC CGAGAAAACGGC TGAGATTCGCGGGATCGTC AC C AC C 
6358 ATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAAT 

II 1 1 M M 1 1 M 1 1 1 1 II 1 1 II II M 1 1 1 1 ! 1 1 M 1 1 1 1 1 ! II 1 1 II 1 1 1 1 1 1 1 1 1 M I 

238 ATGACCCATTCATTGAC ACGCCAGGTCGTAC ACAAC AAAC TGAC G AGC TGC AACTACAAT 

6418 CCGTAAGTCTCTTCCTCGAGGGCCTTAC AGCCTATGGGAAAGTAAGAC AGAGGGAC AAAA 
II 

298 CC 

6478 C ATC ATTAAAAAAAAAGTC TAATTTC AC GTTTTGTAC CC C CC CTTC CC CTC C GTGTTGTA 

300 • 

6538 GGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGAC AAGGCGC AGT 

IIMMrMMMI IN Mill MM III MINIM II II II II MM Mill I MM 

3 00 . GTT AT AC C TC GAAGC TGAC GGGC GAAT ACGC TGC GGC AAAGTGAAC G AC AAGGC GC AGT 
6598 ACC TGC TGGGC GC C GC TGGC AGC GTTC C C T ATC G ATGG ATC AAC CTGGAAT AC G AC AAG A 

MMMM Ml I II II IMI Illllllllllll MIMMIMII II MM Ml Mill 
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359 ACC TGC TGGGC GC C GC TGGC AGC GTTC C C T ATC GATGG ATC AAC C TGG AATAC GAC AAG A 
6658 TAACC C GGATC GTGGGC C TGGATC AGTAC C TGGAG AGC GTT AAG AAAC AC AAAC GGC TGG 

1 1 [ 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ! I E E 1 1 1 1 1 1 1 1 1 1 1 

419 TAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGG 
6718 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGC AGTGAATAATAAAATGTGTGTTTGTCC 

i ii ii ii i inn iii i ii ii i ii i iii i iii i iii i ii inn 1 1 iii mi iii i iii 

479 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGC AGTGAATAATAAAATGTG 

Translation of SEQ128 A: HCK-4 (pUL128) 



1 ATGAGTC CC AAAAAC C TGACGC C GTTC TTGAC GGC GTTGTGGC TGC T ATTGGGTC AC AGC 
1 MSPKgLTPFLT'AL'WLLL.GH. s 

(■ 

61 c gcgtgccgc gggtacgc gc agaagaatgttgc gaattc ataaac gtc aac c ac ccgc cg 
21 rvprvraeec'cefi§v§hpp 

121 gaac gc tgttac g atttc aaaatg tgc aatc gc ttc ac c gtc gc actgc ggtgtc c gg ac 
41 ercydfkmc§rftvalrcp d 

181 ggc gaagtc tgc tac agtc c cgagaaac ggc tg ag attcgc ggg atc gtc ac c ac c atg 
61 gev c ys p ektaeirgivttm 

241 acccattcattgacacgcc aggtcgtacacaac aaactgacgagctgcaactacaatctg 
81 thsltrqvvh|kltsc|y|l 

301 ttatacctcgaagctgacgggcgaatacgctgcggcaaagtgaacgacaaggcgcagtac 
101 ly l eadg. rircgkvgdkaqy 

361 ctgctgggcgc cgctggc agc gttccctatc gatggatc aacc tggaatac gac aagata 
121 l lg aagsvpyrwi|leydki 

421 acccggatcgtgggcctggatcagtacctggagagcgttaagaaacacaaacggctggat 
141 t r.ivgldqylesvkkhkrld 

481 gtgtgc cgc gc t aaaatgggc tatatgc tgc agtga 

161 VCRAKMGYMLQ*. 
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Figure 14 



Translation of SEQUL130: HCK-5 (pUL130) 



1 ATGC TACGGC TTCTGC TTC GTC ACC ACTTTC AC TGC CTGC TTC TGTGC GC GGTTTGGGC A 

.1 MLRLL. LRHHF HCLLLCAVWA 

6 1 . ACGCCCTGTCTGGCGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCA 

21 T PC LASPWFTLTANQNP S P P 

121 TGGTCTAAACTGACGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTC 

41 WSKLTYPKPHDAA.T.FYCPFL 

181 TATCC C TC GC CC C C AC GGTCC CC CTCGC AATTC CCGGGGTTC C AGCGGGTATC AAC GGGT 

61 YPSPPRSPSQFPGFQRVSTG 

241 C C C GAGTGTC GC AAC GAG AC C C TGT ATC T GC TGTAC AACC GGG AAGGC C AGAC C TTGGTG 

81 PECR NETLYL LYNREGQT LV 

301 GAGAGAAGCTCC ACCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGC AATCAGACC 

101 E R S S TWVKKVIW YL S GRNQ T 

361 ATCCTCCAACGGATGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGC 

121 ILQRMPRTASKPSDGNVQI S 

421 GTGGAAGAC GCC AAGATTTTTGGAGC GC AC ATGGTGC C C AAGC AGAC C AAGC TGC TAC GT 

141 VEDA KI FGAHMVPKQTKLLR 

481 TTCGTCGTCAACGATGGCACACGTTATCAGATGTGTGTGATGAAACTGGAGAGCTGGGCC 

161 FVVND GTRYQMCVMKLESWA 

541 CACGTCTTCCGGGACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAAT 

181 HVF.RDYSVSF QVRLTFTEAN 



601 AACC AGACTTACACCTTCTGC ACCCATCCCAATCTC ATCGTTTGA 

201 NQTYTFC THPNLIV* 



